PENTAGON AND HEXAGON EQUATIONS FOLLOWING FURUSHO 



DROR BAR-NATAN AND ZSUZSANNA DANCSO 



Abstract. In [F] H. Furusho proves the beautiful result that of the three defining equations 
for associators, the pentagon implies the two hexagons (see also |W]). In this note we 
present a simpler proof for this theorem (although our paper is less dense, and hence only 
slightly shorter). In particular, we package the use of algebraic geometry and Grocthcndicck- 
Teichmuller groups into a useful and previously known principle, and, less significantly, we 
eliminate the use of spherical braids. 
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1. Introduction 

Associators are useful and intricate gadgets that were first introduced and studied by 
Drinfel'd in |Drl] and |Dr2] . The theory was later put in the context of parenthesized (a.k.a. 
non-associative) braids by |LM] and |BNj . Associators arise in several different areas of 
mathematics, and thus constructing an associator is a task faced by many. Unfortunately, 
it is a very difficult task: no closed formulas are known at present. 

Associators are essentially the solutions to three equations, called the pentagon and the 
positive and negative hexagons, which live in complicated diagrammatic spaces. Finding an 
associator amounts to finding a solution to this system of equations. Furusho's result states 
that the last two of these three equations are superfluous: a solution to the pentagon will 
automatically be a solution to both hexagons. This statement is quite surprising, and thus 
we felt that a simpler proof would be of value. 

The paper is organized as follows: we first review some definitions, then present the main 
tool or "extension principle" for the simplified proof: a theorem from |Dr2] and |BNj . and 
also a simpler standard fact which we call the "linearization principle" . We then prove the 
theorem modulo a "Lie-algebraic" version which we call the "Main Lemma" , followed by the 
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proof of the Main Lemma, and finally a side note on how one of the algebraic maps used in 
the proof arises from topology. 

The part of the proof which depends on the extension principle is significantly different 
from that of [F] (and [W]), and significantly simpler: Furusho's proof uses algebraic geom- 
etry and Groethendieck-Teichmuller groups to go from the Lie algebra version to the group 
statement. 

The proof of the main lemma is essentially Furusho's proof, except for the elimination 
of the use of spherical braids. Using spherical braids makes the statement of the Main 
Lemma look very symmetric, so in some sense prettier. However, in our opinion, it makes 
the combinatorial argument of the proof less transparent: it is the break of symmetry which 
helps the reader figure out how someone might have discovered the proof. Also, we save the 
trouble of having to pass back and forth between the spherical and regular chord-diagram 
spaces. The elimination of spherical braids is easy: the space of chord diagrams of pure 
spherical 5-braids and that of regular pure 4-braids only differ in taking a quotient by the 
center of the latter. 

1.1. Acknowledgements. The authors would like to thank the referee for the thorough 
reading of the paper and detailed helpful suggestions, and Stavros Garoufalidis for an en- 
lightening discussion of the proof of the Main Lemma. 



2. Definitions 

Algebraically, the space of chord diagrams of pure n-braids, A n , is defined to be the graded 
completion of the following non-associative algebra, where generation is understood over a 
fixed field k: 



1 < i ^ j < n 



tij Ej'i) 



[tij,tki\ = when i,j,k,l are distinct (L), 
[ty + t ik , tjk] = when i, j, k are distinct (4T) 

Here, 4T is short for four term relations, and L stands for locality. 

One usually thinks of A n in terms of horizontal chord diagrams on n strands, where t^ 
is represented by a chord between strands i and j, and multiplication is done by stacking 
diagrams: 



1 2 




i j k I 



i j k l 



+ 



+ 



{AT) 



Let us denote by the free lie algebra over a field k of characteristic 0, on two generators 
X and Y, and by UT2 its universal enveloping algebra, which is isomorphic to k{(X,Y)), 
the algebra of non-commutative power series over k. By a superscript (m), as in UJ 7 ^, 
we mean the space or object modulo degree (m + 1), e.g. non- commutative polynomials of 
degree up to m. 
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There is a co-product on UT 2 , defined by A(X) = 1 <g> X + X ® 1, A(Y) = 1 ® y + y <g> 1. 
An element $ is called group-like if A($) = $ ® $. An element </? is primitive (meaning it 
is a Lie-algebra element) if A(</?) = 1 ® + ® 1. 

Now let us present the main characters of this story: the pentagon and hexagon equations. 

The pentagon equation originates from the fact that in a non-associative algebra, there are 
five ways to multiply four elements, and these are connected by a pentagon of re-associations. 
This fact and its parenthesized braid analogue are shown in the figure below: 

12(34^ (••) (••) ^(12)34 

•(•(••)) ((••)•)• 

234 \^ ^123 

The above equality of parenthesized braids implies an algebraic equation in A4 called the 
pentagon equation, P($) = 1, for $ G UF 2 , where 

P($) = $(t 13 + t 23 , t34) _1 $(tl2, t 23 + t24)~ 1 ®(t 23 , t U )$(t 12 + *13, *24 + t 34 )$(t 12 , t 23 ). 

The hexagons arise from the following equivalences of parenthesized braids: 



Algebraically, the implied equations in *4 3 are H±(<&) = 1, for $ G UJ r 2 , where 

H ± (*) = e (t-^^) Ht 13 ,t 12 )e Ht^t^e (±*£\ $(t l2 ,h 3 ). 

Here, e(x) := e x . 

An associator is a group-like element of UT 2 which satisfies the pentagon and hexagon 
equations. 

Note that the spaces A 3 and 1AT 2 are almost isomorphic: 1AT 2 = A 3 /Z(A 3 ) is the quotient 
of A 3 by its center Z(A 3 ) = (t 12 + t 23 + t 31 ). An associator is sometimes defined as an 
element of A 3 , which is equivalent, since associators are commutator-grouplike, meaning 
their abelianization is 1. (This follows from the pentagon equation.) 

3. Theorem and Proof 
In [F] Furusho proves the following surprising result: 

Theorem 3.1. If $ is a group-like element oflATi with c 2 (&) = where c 2 (&) denotes 
the coefficient of XY in and $ satisfies the pentagon equation P(<&) — 1 in Aa, then $ 
satisfies the hexagon equations if±($) = 1 in A 3 , and therefore $ is an associator. 
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To prove Theorem 13.11 we need two tools, which we will call extension and linearization 
principles, and a major lemma, Lemma [3.21 below. 

The "extension principle" is the assertion that an associator modulo degree m can be 
lifted to an associator modulo degree (m + 1). This statement is highly non-obvious; it takes 
up a good part of [Dr2j and the main part of |BNj . Its main application is the fact that 
non-trivial rational associators exist: indeed, up to degree 2 a direct computation shows that 
P($) = 1 and P±($) = 1 have a unique non-trivial solution (namely, $^ = 1 + -^[X, Y]), 
and then by extension, one may construct a rational associator inductively. 

The "linearization principle" is the standard fact that the pentagon P($) = 1 and the 
hexagons P±($) = 1 can be "linearized". Precisely, this means that there exist degree- 
preserving linear operators dP : UF 2 —tAt and dH + = dH^ = dH : lATi — > A 3 so that if 
$ and $' satisfy the pentagon equation modulo degree m, i.e. P($) = P($') = 1 modulo 
degree m, and are equal modulo degree m, i.e. ip := ($ — $')( m ) is homogeneous of degree 
m, then 

P($) - P($') = dP(p) modulo degree (m + 1) 

Likewise for the hexagons: if $ and $' satisfy one of the hexagons modulo degree m, and 
are equal modulo degree m, then 

P±($) - H±(&) = dH(p) modulo degree (m + 1), 

where <p is as above. Note that if $ and $' are group-like, then p is primitive. 
To prove the theorem we use the following main lemma: 

Lemma 3.2. If tp e 1AT 2 is homogeneous of degree m > 3 and is primitive, and satisfies 
the linearized pentagon equation dP(p) = , then dH(p) = 0, in other words the linearized 
pentagon equation implies the linearized hexagon equation. 

Proof of Theorem 13.11 from Lemma 13.21 Let us assume that $ is as in Theorem 13.11 
in particular P($) = 1, and, by contradiction, that one of the hexagons fails to hold, i.e. 
H±(<&) 7^ 1. Let m be the minimal degree in which this happens. 

By the simple computation in low degrees mentioned before, we know that m > 3. Note 
that by the minimality of m, $ satisfies the hexagons modulo degree m and hence it is an 
associator modulo degree m. By extension, there exists a $' G UTi which agrees with $ 
modulo degree m, and which satisfies both the pentagon and the hexagon equations modulo 
degree (m + 1). Let <p — (<& — $')( m ), which is homogeneous of degree m since $ = $' modulo 
degree m. 

By linearization, dP(p) = P($) — P($') = modulo degree (m + 1), so by Lemma [3.21 
P±($) — H±(Q') = dH(p) = modulo degree (m + 1). Back again by linearization, as $' 
satisfies the hexagons in degree m, it follows that so does <3>, contradicting our pessimistic 
assumption from the beginning. □ 

4. Proof of the main lemma 

By explicit computations, 
dP(p) = -p(t 12 , t 23 + t 2i ) - p(t 13 + t 23 , t u ) + p(t 23 , t u ) + p{t 12 + t i3 , t 2 4 + hi) + ¥>(*12, ^23), 

dH(p) = p(t 13 , t 12 ) - p(t 13 , t 23 ) + <p(t 12 , t 23 ). 

Let us start by proving two simple but necessary lemmas. Throughout this section, p is 
assumed to be a primitive, homogeneous element of UJ r {2) of degree > 3, as in Lemma [3.21 
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Lemma 4.1. dP(ip) = implies that if is anti- symmetric, i.e. ip(X,Y) + ip(Y,X) = 0. 



Proof. We use the map q : *A 4 — > T 2 denned by: t 12 h-> X, t 23 ^ Y) 
ti 3 H- (— X— y), tu H- Y, t 24 (— X — Y), and t 34 i-> X, as illustrated 
by the figure on the right. Since q(dP(ip)) = ip(X,Y) + <f(Y,X), the 
lemma follows. □ 

Note that by the anti-symmetry of f, we have: 

dP(f) = lf(t 12 , t 23 ) + <f(tu, *13 + <23) + ¥>(*23 + <24, *12) + ^23, ^34) + <f{tl2 + *13, ^24 + t U ), 

dH(ip) = lf(t l2 , t 23 ) + lf(t 23 , t 31 ) + (p{t 3 x,t 12 ). 

Lemma 4.2. The linearized hexagon dH(ip) = is equivalent to the 
equation 

(1) <f(X, Y) + ip(Y, -X-Y) + ip(-X -Y,X) = 

in UT 2 , if <f is of degree > 2. 

By an abuse of notation, we shall denote the left side of equation ([T]) by dH(X, Y). 

Proof. dH(<f) = implies ([T]) via the quotient map 7r : A 3 — > UT 2 which factors out by the 
center of A 3 , defined by 7r(ti 2 ) = X, n(t 23 ) = Y, and vr(t 13 ) = —X — Y. 

For the other direction we apply the map i : IAT 2 — > A 3 given by i(X) = t± 2 and i(Y) = t 23 
i(dH(X, Y)) = <f(t 12 , t 23 ) + <f(t 23 , -t 12 - t 23 ) + <f(-t 12 - t 23 , t 12 ) = <f(t 12 , t 23 ) + ip(t 23 , t 13 ) + 
<f(ti3,ti2) = dH(ip), where the second equality is due to the fact that if G \F 2 ,T 2 \ (since it 
is primitive of degree > 2), and that the element t\ 2 + t 23 + £13 is central in A 3 . □ 

Proof of the Main Lemma 13.21 Let us first introduce some notation. Any map of sets 
/ : [r] — > [s], where [r] := {1,2, ...,r} and [s] = {1,2, s}, induces a map / : A s — > A r , 
where f(Uj) := XLe/- 1 ^) /Se/- 1 ^') ^ either of the pre-images is empty, we understand 
the sum to be zero. 

Now, if i/)(X,Y) G UT 2 , denote ^(123) := ^(t X2 ,t 23 ) G A 3 ; and let 

'.•((/,.../,)(./,..,/„)(/.•;.../.•,)) := ^(^(123)), 

where g : [t + u + v] — >■ [3] is given by g~ 1 (l) = {i\...it} and similarly for 2 and 3. 

The permutation group S4 acts on A4 by commuting the strands (i.e. the indices). For 
any a G 1S4, we know that cr(dP(<f)) = 0. We try to find a (small) set of permutations <7j 
such that = J2i cr idP( l f) = 9j(dH(ip)), for some gj : [4] — > [3]. 

In fact, the following four permutations work (the notation a = XiX 2 x 3 Xa means o~(i) = Xi 
for i = 1,2,3,4): 

Gl = id, a 2 = 4231, a 3 = 1342, a 4 = 4312 
. Using the notation introduced above, 

= £li o-tdPiif) = 

= if (123) + ^(43{12)J+ ^((34)21) + (^(234) + ^(l(23)4j+ 
+^(423) + ^(13{42)X+ <f ((31)24) + ^(231) + ^£4(23)17+ 
+(^(4347+ v?(24(13)) + <^(42)3IX+ ^(342) + ^(1(34)2) + 
+ifJ^my+ ^(21(43)) + ifJJ^2jM) + f(312) + ^(4(31)2) = 
= dH (123) + dH ((34)21) + dH(423) + dH ((31)24), 
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where the cancellations are by the anti-symmetry of p (Lemma 14. ip . 

Note that every chord appearing on the right side ends on strand 2, i.e., dH (123) + 
dH ((34)21) + dH (423) + dH ((31)24) e (ti 2 ,t 23 ,t 2 ±) . Also, (t 12 ,t 2 3,tu) = ^3 Q A, since 
there are no relations in A4 that involve only these elements. 

Note that the anti-symmetry of tp also implies the anti-symmetry of dH, i.e. dH(X, Y) = 
—dH(Y,X), and in particular dH(X,X) = 0. We can now finish the proof using two 
projections: 

Let pi : T3 — > T 2 be the projection defined by t\ 2 (-)■ X, t 23 !->■ Y, t 2 4 \-> X, and 
apply this to the equality = dH(123) + dH((34)21) + dH(423) + dH((31)24). We obtain 
= dH(X, Y) + dH(X, Y) + dH(X + Y,X) + dH(X + Y,X), and therefore dH(X + Y,X) = 
-dH(X,Y). 

Now do the same for the projection p 2 defined by t\ 2 (-)■ X, t 23 \-> X, t 2i \-> Y. We get 
= dH(X, X) + dH(Y, X) + dH(X + Y, X) + dH(2X, Y), and so using the above, we arrive 
to dH(2X,Y) = 2dH(X,Y). 

This means that dH(X, Y) contains only commutators that involve one X and some 
number of Y's, and so writing dH in a linear basis we have: 

dH(X,Y) = Y.n=i a n(adYy-\X). 

Since dH(X, Y) = —dH(Y, X), we know that a n = for all n except possibly n — 2, and 
a 2 = by the assumption that ip is of degree > 3. Thus, dH(X, Y) = 0, which is equivalent 
to the Main Lemma by Lemma 14.21 □ 

A surprising moment in the proof above is the equality relating a sum of four linearized 
pentagon equations to a sum of linearized hexagons. As pointed out by Stavros Garoufalidis, 
there is a similar but more natural equality which, as the reader can check, arises from the 
permuto-associahedron on page 21 of [BNj : 

dP (1234) -dP (1243) +dP (1423) -dP (4123) = dH '(34(12)) -dH '((23)41) +dH (241) -dH (342) 

It is easy to verify that the rest of the proof goes through the same way using this equtlity 
in place of the one above. 

5. A SIDE NOTE 

The projection q, used in the proof of Lemma [4.11 has an interesting property and a nice 
topological interpretation. 

If we embed *4.3 in A4 on any three strands (i.e. by any embedding of the index set [3] 
into [4]), and then apply q, we get the "almost isomorphism" between A3 and JF 2 , i.e. the 
composition is factoring out by the center of ^.3: 

A3 

^^^^^mod out by center = 

^ *ia + t2z + h 3 ) 
embed on any ^^^^^(almost an isom) 
3 strands ^^^^^^ 

^4 q >^2 

This is a braid theoretic analogue of the following fact about the symmetric group £4. 
Since S4 is isomorphic to the group of symmetries of the tetrahedron, and each element 
of £4 also permutes the three pairs of opposite edges of the tetrahedron, we obtain a map 
p : £4 — > S3. Pre-composing this map with any embedding of £3 into £4 induced by an 
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embedding of the set [3] into [4], we get an automorphism of S3, namely, an isomorphism 
from S3 to the group of symmetries of a face: 



S4 permutes sets of opposite edges 
map p : S4 — > S3 



Topologically, q : A4 — > Ti is induced by a map q, defined as follows: 

P B i — - — > spBi— — > pB 3 / M i twist 







Here, pBi denotes the pure braid group on i strands, and spB^ denotes the 
group of pure spherical braids on four strands. Spherical braids live in S 2 x I, 
as opposed to D 2 x J, which means that one strand wrapping all the way 
around the others is trivial, as shown for strand 1 on the right, and similarly for all other 
strands. This defines the quotient map q~\ above. 

For q 2 , take any spherical braid on four strands, pull the last strand straight, I 1 
and consider the first three strands as a braid in the complement of strand 4. 
The target space of this map is the group of regular pure 3-braids factored out 
by the full twist of the three strands: pull the 4-th strand straight on the left side of the 
spherical relation shown in the figure on the right, and observe that what we get is a full 
twist of the first 3 strands, which then has to be trivial in the image. 

Note that the chord diagram map induced by factoring out by the full twist in pB% is 
exactly the quotient map ir : A3 — > Ti which sends tyi + £23 + ^13 to 0. 
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